Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Fine-grained emotion classification of Chinese microblog based on syntactic dependency graph
Cheng FANG, Bei LI, Ping HAN, Qiong WU
Journal of Computer Applications    2023, 43 (4): 1056-1061.   DOI: 10.11772/j.issn.1001-9081.2022030469
Abstract276)   HTML16)    PDF (1598KB)(174)       Save

Emotion analysis can quickly and accurately dig out users’ emotional tendencies, and has a huge application market. Aiming at the complexity and diversity of the microblog language’s syntactic structures, a Syntax Graph Convolution Network (SGCN) model was proposed for fine-grained emotion classification of Chinese microblog. The proposed model has the characteristics of rich structural and semantic expression at the same time. In the model, a text graph was constructed on the basis of the dependency between words, and the correlation degree between words was quantified by Pointwise Mutual Information (PMI). After that, the PMI was used as the weight of the corresponding edge to represent the structural information of the sentence. The semantic features fusing location information were taken as the initial features of nodes to increase the semantic features of nodes in the text graph. Experimental results on the microblog emotion classification dataset of Social Media Processing 2020 (SMP2020) show that for two sets of microblog data containing six categories of emotions: happiness, sadness, anger, fear, surprise, and emotionlessness, the average F1-score of the proposed model reaches 72.64% which is 2.75 and 3.87 percentage points higher than those of the BERT (Bidirectional Encoder Representations from Transformers) Graph Convolutional Network (BGCN) model and the Text Level Graph Neural Network (Text-Level-GNN) model, verifying that the proposed model can use the structural information of sentences more effectively to improve the classification performance than other deep learning models.

Table and Figures | Reference | Related Articles | Metrics
Data deduplication in Web information integration
LIU Xueqiong WU Gang DENG Houping
Journal of Computer Applications    2013, 33 (09): 2493-2496.   DOI: 10.11772/j.issn.1001-9081.2013.09.2493
Abstract582)      PDF (645KB)(403)       Save
Since traditional data dedupliation methods are of low time efficiency and detection accuracy, a Stepwise Clustering Data Elimination (SCDE) method was presented based on the features of Web information integration. Firstly the whole record set was divided into sub-sets using both key attributes division and the Canopy clustering technique, and then the similar records in each sub-set were accurately eliminated. A fuzzy entity matching strategy based on dynamic weight was proposed to accurately eliminate the duplicate records, which reduced the influence of missing attribute on record similarity calculation, and the name of company was especially treated to improve the matching accuracy. The results show that the method is superior to traditional algorithms in time efficiency and detection accuracy, and the precision is improved by 12.6%. The method is applied in forestry yellow page system and performs well.
Related Articles | Metrics